Ensemble-based multimedia asset recommendation system

ABSTRACT

An asset to be displayed during an online engagement is identified. A mixture of multimedia assets that include any combination of: audio, video, text, or image, are selected to be displayed simultaneously with the asset. Each multimedia asset selected based at least on a relationship between the multimedia asset and the asset to be displayed during the online engagement. A presentation screen is created that includes the selected multimedia assets and the asset to be displayed during the online engagement based on display parameters of a receiving device.

CROSS-REFERENCE TO RELATED APPLICATIONS; PRIORITY CLAIM

This application is a Continuation of U.S. patent application Ser. No. 14/811,803, filed Jul. 28, 2015, which claims benefit of U.S. Provisional Patent Application No. 62/029,713, filed Jul. 28, 2014, the entire contents of which is incorporated by reference as if fully set forth herein under 35 U.S.C. § 120.

TECHNICAL FIELD

The present disclosure generally relates to the automatic selection of multimedia content.

BACKGROUND

The approaches described in this section are approaches that could be pursued, but not necessarily approaches that have been previously conceived or pursued. Therefore, unless otherwise indicated, it should not be assumed that any of the approaches described in this section qualify as prior art merely by virtue of their inclusion in this section.

Content suggestion systems traditionally select content based on a relationship to a user's preferences or a relationship to parameters of a search query. Recommendation engines use keyword or tags in order to select content that match a particular content selected by a user. These systems have a singular function that is focused on presenting a set of content that is similar in type, e.g., video, to the content being accessed by the user. Further, these systems typically present the suggested content to the user in a list format. Content providers lack a mechanism for getting assets from their mixed-multimedia asset libraries in front of users and to present the mixed-multimedia assets to users in meaningful and engaging ways.

BRIEF DESCRIPTION OF THE DRAWINGS

In the drawings:

FIG. 1 illustrates block diagram of an asset portfolio manager, according to an embodiment of the invention;

FIG. 2 illustrates an example layout of an ensemble of electronic assets, according to an embodiment of the invention;

FIG. 3 illustrates an example of pairwise comparison of assets, according to an embodiment of the invention;

FIG. 4 illustrates an example of feature values for three assets according to an embodiment of the invention;

FIG. 5 illustrates an example of the absolute value of differences of assets for feature 1 of the example data, according to an embodiment of the invention;

FIG. 6 illustrates an example of the absolute value of differences of assets for feature 2 of the example data, according to an embodiment of the invention;

FIG. 7 illustrates an example of the absolute value of differences of assets for feature 3 of the example data, according to an embodiment of the invention;

FIG. 8 illustrates an example of the weighted composite differences of assets for feature 1, 2, and 3, of the example data, according to an embodiment of the invention;

FIG. 9 illustrates an example of the weighted asset-to-asset similarities of each pair of assets in the example data, according to an embodiment of the invention;

FIG. 10 illustrates an example display layout, according to an embodiment of the invention;

FIG. 11 illustrates an example of how the adjusted similarity matrix is computed, according to an embodiment of the invention;

FIG. 12 illustrates an example matching assets with section and slot requirements, according to an embodiment of the invention;

FIG. 13 illustrates an example system block diagram, according to an embodiment of the invention; and

FIG. 14 illustrates a computer system upon which an embodiment may be implemented.

DETAILED DESCRIPTION

In the following description, for the purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the present invention. It will be apparent, however, that the present invention may be practiced without these specific details. In other instances, well-known structures and devices are shown in block diagram form in order to avoid unnecessarily obscuring the present invention.

Embodiments are described herein according to the following outline:

-   -   1.0. General Overview     -   2.0. Methodology     -   2.1. Ensemble Strength     -   2.2. Ensemble Duplication     -   2.3. Ensemble Depth     -   2.4. Construction of Ensembles     -   2.4.1. Similarity Matrix Preparation for Ensemble Construction     -   2.4.2. Weak Ensemble Construction     -   2.4.3. Strong Ensemble Construction     -   2.4.4. Semi-strong Ensemble Construction     -   2.4.5. Improved Semi-strong Ensemble Construction     -   2.5 Layout Comparison     -   3.0 Example Embodiments     -   4.0 Implementation Mechanisms-Hardware Overview     -   5.0. Extensions and Alternatives

1.0. GENERAL OVERVIEW

This overview presents a basic description of some aspects of a possible embodiment of the present invention. It should be noted that this overview is not an extensive or exhaustive summary of aspects of the possible embodiment. Moreover, it should be noted that this overview is not intended to be understood as identifying any particularly significant aspects or elements of the possible embodiment, nor as delineating any scope of the possible embodiment in particular, nor the invention in general. This overview merely presents some concepts that relate to the example possible embodiment in a condensed and simplified format, and should be understood as merely a conceptual prelude to a more detailed description of example possible embodiments that follows below.

Disclosed herein are systems and methods for pairing assets for online presentation where the plurality of electronic assets are available in a wide range of formats. The processes associated with selection and ordering of these assets to be made available for consumption in an online environment is increasingly becoming more data-driven and requiring that electronically-stored assets of different formats be presented as an ensemble or collection. A non-exhaustive list of asset types include text (TXT, HTML, etc.), images (GIF, JPG, TIFF, BMP, PNG, etc.), videos (MP4, WMV, FLV, etc.), geographic (SHP, KML, etc.), coded (JavaScript, PHP, etc.), and sound (MP3, WMV, QT, etc.) files. The asset itself is a distinct electronic file that may be presented to a user for consumption either by itself or simultaneously with other electronic files.

For each asset type, both the storage/presentation requirements and resulting consumption of the assets may differ and require cognitive processing by using one or more of the human user's five senses: sound, sight, touch, taste, and smell. An engagement is the timeframe during which a user is consuming the electronic asset using one or more of the five human senses. Presentation of information to the human user may appeal to one or more of these senses and there exists a business purpose for pairing files of the various types to increase the user's satisfaction with their consumption and thereby allowing the asset portfolio manager (the individual responsible for deciding upon the assets to display) to yields some desirable benefit from a satisfactory consumption of the assets. The concept of pairing assets is not restricted to two distinct assets—rather, a collection of assets (containing at least two assets) is selected for presentation during a single engagement by a user. An asset may be categorized as stationary or non-stationary. A stationary asset will not change over time—examples include an image, a non-interactive electronic map, or a sound file with a single tone (“ding”, “beep”, etc.). Non-stationary assets are electronic files that vary over time such as a video, an animated image, a song, or an interactive map. The changes over time that cause an electronic asset to be considered non-stationary can be a) executed automatically when an appropriate application exists that allows the temporal variations to be presented; b) triggered according to the internal clock of the device—where available; or c) initiated interactively by the electronic assets' consumer during an engagement.

Management of an asset collection may be conducted by the owner of the collection or another party given authorization to generate recommendations and pursue revenue acquisition on behalf of the owner. This entity may also be referred to as the “asset portfolio manager”. Referring to FIG. 1 , the asset portfolio manager 101 may be software and/or hardware resident in a server, client computer, distributed cloud computing system, virtual machine, etc. An embodiment performs a methodology for pairing multimedia assets in a single presentation via an ensemble of assets with the intent of enriching and lengthening the engagement experience to allow a higher level of value to be achieved by the content provider Value may be based on an objective function determined by the asset portfolio manager and may include, but is not limited to: a) maximization of revenue b) user satisfaction c) recurrence of visitors. Allowing the user to more deeply explore related assets in a variety of different formats, in perhaps more detail, and/or by appealing to various sensory modes is assumed to lead to a preferred engagement for the user. Over time, the rotation of different number, placement, and types of assets can be used to determine which presentations yield higher revenues. The system therefore not only enriches the user experience through a co-presentation of a collection of multi-sensory and multimedia assets, but provides a mechanism for testing various layouts to yield a value outcome that is perhaps more favorable to the owner/managers of the displayable content.

During an online engagement where a recommendation system algorithm 102 considers candidate assets stored in the asset library 103 and available for presentation and consumption by a user, there often arises the desire to use the available device presentation capabilities to present more than one asset at the same time in the shared display space. The asset portfolio manager may be interested in simultaneous co-presentation of a set of assets related to the online asset 104 to enhance the viewing experience and provide a means of presenting the content in a way that appeals to the multi-sensory capabilities of human consumers. The asset portfolio manager—which has access to a library of multimedia assets 103 available to be displayed (either remotely over a network, such as an intranet, internet, etc., or locally over a local network of connection)—automatically selects which assets are most appropriate for co-display given their preferences for content delivery 105. The selected assets may be sent by the asset portfolio manager to the receiving device, or a display screen, web page, etc., may be created that includes the selected assets and the assets that are to be displayed online during the online engagement 105. Alternatively, the asset portfolio manager may display a presentation screen template to an administrator and suggest the placement of each of the selected assets within the presentation screen template. The administrator may indicate the placement of each selected asset and instruct the asset portfolio manager to send the completed presentation screen to a receiving device. In an embodiment, the selected assets may be homogeneous in nature, e.g., video only, etc. In another embodiment, the selected assets may be heterogeneous and may comprise any combination of: text, video, hyperlinks, images, audio, etc. To meet this need, the system performs several features:

-   -   1. Definition of an anchor asset—one that may be presented with         certainty during the online engagement     -   2. Selection of assets for co-presentation that can be         simultaneously delivered in the same user engagement session     -   3. Consideration of the user's human sense(s) to which the asset         portfolio manager wishes to be present in the online environment     -   4. Selection of assets for presentation that remain fixed or may         vary during the engagement (such as a fixed image or a video)         according to the asset manager's choice     -   5. Enforcement of a rule set that ensures the probability of the         selection of assets to vary based on the preferences of the         asset portfolio manager and physical constraints on the display         device     -   6. An automated system that, given the constraints and         preferences set by the asset portfolio manager, selects an         ensemble of assets for co-presentation that share a similar         theme     -   7. A mechanism for down-weighting an asset so that it has a         reduced chance of being selected for display     -   8. A process by which various presentation alternatives (in         terms of the number and type of assets, the similarity among the         assets, and the physical layout of the presentation of assets in         an online environment) can be explored and measured to determine         the presentation format that yields the most value to the user.

One application of an embodiment can be found in the online news publishing industry. For any given asset (e.g., textual, such as a news article, etc.) that can be presented on a web page, there may arise the desire of the asset portfolio manager 101 to also co-present relevant images, videos, sound clips, and hyperlinks to related articles to provide additional information in a variety of format with the intent of improving the satisfaction associated with the consumer's engagement with the content. Referring to FIG. 2 , an example of a presentation of multimedia assets 200 is shown where a news article 201 (a text asset) is shown in the center of a display and accompanied by a variety of assets 202 (audio, image, video, advertisements, links to other articles, etc.) in the ensemble. A successful presentation would deliver appropriate assets that consider the user's various sensory modes to which the asset portfolio manager 101 wishes to appeal (e.g., given predetermined limits on content and the physical constraints on the display device, etc.) while ensuring the ensemble of presented assets is, as a group, relevant to the article that has been selected to be displayed with certainty (the anchor asset).

An embodiment displays or creates a display of video asset recommendations to users engaging a variety of online websites using mobile and non-mobile electronic data providers. The system:

-   -   a. Creates recommendations to a user based on an asset (e.g.,         video, audio, text, etc.) library     -   b. Ensures that content in an asset (e.g., video, audio, text,         etc.) portfolio is monetized     -   c. Delivers analytics to learn the most valuable content in a         client's video library, for example.     -   d. By operating as a supply-side platform (SSP) for online         video, the system utilizes data to increase engagement and         provide solutions to assure that advertisements are delivered         and advertisement rates are optimized.

Inputs: The inputs to the system may be:

-   -   L_(t) is the physical arrangement of assets in presentation t         limited by the display parameters of the presentation device,         such as the physical characteristics of the presentation device         and the embedded software that enables certain asset types to be         consumed (for example, presentation of a video file in the         layout may require that there exists a capability for the video         to be played, if desired by the user during engagement). FIG. 2         , shown earlier, is an example of a possible physical         arrangement of assets.

A is the set of portfolio assets available for selection and subsequent display. Individual asset a_(i) is the i asset among n assets: A={a₁, a₂, . . . , a_(n)}

a_(i) is the anchor asset that can be presented with certainty and that is capable of being presented given the physical characteristics and the other limitations of L_(t).

A^((−i)) is the remainder of the portfolio assets that may be selected for co-presentation along with the anchor asset, a_(i): A=A^((−i))∪a_(i), A^((−i))∩a_(i)=Ø

X is a set of k features describing each of the n assets.

X={x ₁ ,x ₂ , . . . , x _(n)} where x _(i) ={x _(i,1) ,x _(i,1) , . . . , x _(i,k)}.

τ_(i,c) is a subset of a feature of asset a_(i) referred to as a collection descriptor which is a label or index indicating a theme under which the asset is grouped (e.g., sports, entertainment, justice, etc.). An asset a_(i) may belong to c≥1 asset groups so that τ_(i)={τ_(i,1), τ_(i,2), . . . , τ_(i,c)}. Additionally, more refined theme division may exist describing a physical locale and more detailed membership requirements. For simplicity, each distinct collection label, sub-collection criteria and locale can be uniquely identified using a collectionID as shown in Table 1. Note that the entry corresponding to collectionID=3 has a locale of “Worldwide” which is equivalent to describing an asset whose relevance is without geographic limits.

ssfID_(i) is a single feature of asset a_(i) indicating three characteristics of the asset; the human sensory mode to which it appeals, whether the asset is stationary or non-stationary, and a description of the asset type. Table 2 gives an example of some instances of various ssfIDs where the plurality of ssfIDs has a size of P.

TABLE 1 CollectionID Example Encompassing Distinct Locale/Theme Combinations collectionID collectionLabel collectionDetail Locale 1 Sports Football United States 2 Sports Soccer Europe 3 Politics Congress United States 4 Entertainment Celebrities Worldwide 5 Entertainment Movie Reviews United States 6 Justice Crime Illinois USA 7 Justice Legislation Illinois USA . . . . . . . . . . . . C . . . . . . . . .

TABLE 2 ssfID Examples Describing Distinct Sense/Stationarity/Description Triplets ssfID Sense isStationary fileType 1 Sight Yes Image 2 Sight No Map 3 Sight No Animated GIF 4 Sight. No Video . . . Sight . . . . . . Sound Yes Tone Sound No Music Sound No Audio Clip Sound . . . . . . p . . . . . . . . .

κ_(i,c) is a text-based subset of a feature of asset a_(i) referred to as a keyword list which is set of labels describing asset a_(i). Asset ai a_(i) may have w≥0 keywords so that κ_(i)={κ_(i,1), κ_(i,2), . . . , κ_(i,w)}. Keywords may be in any language and contain special characters and symbols.

FileTypeID_(i) is the type of file (based on the file extension associated with asset paired with a ssfID). Examples of various file types that may be associated with each distinct ssfID are shown as a many-to-one relationship in Table 3. Included in this table is a value for each row called additiveValue (always greater than or equal to 0) that can be used to induce a preference for certain file types sharing the same ssfID.

Larger values associated with additiveValue increase the chances that, among all files selected for co-presentation with the anchor asset a_(i), the file having the specified type will be selected.

TABLE 3 File Types Associated with Various ssfIDs FileTypeID ssfID fileExtension additiveValue 1 1 GIF 3 2 1 TIFF 2 3 1 JPG 1 4 1 JPEG 1 5 2 KML 1 6 3 GIF 1 7 4 MP4 1 . . . . . . . . . . . . f

x_(i) as described earlier contains k features describing asset a_(i);x_(i)={x_(i,1), x_(i,1), . . . , x_(i,k)}representing the asset's;

-   -   1) collectionID list, τ_(i)     -   2) distinct sense/stationarity/description triplet denoted by         ssfID_(i)     -   3) keyword list, κ_(i)     -   4) All of the remaining k−3 features describing asset a_(i).

An example of the n×k table use to store the various sets of k features for each of the n assets in A is shown in Table 4. In addition to the k features, the stored table may include an assetID i=1, . . . , n along with a FileTypeID.

TABLE 4 k-dimensional Feature Set for n Assets Features Asset FileTypeID collectionID(s) ssfID Keywords field_4 field_5 . . . field_k 1 4 1, 4 1 New England Patriots. Tom Brady. AFC, Touchdown 2 4 1 1 Jasonville Jaguars. Jordan Todman. Kickoff return 3 1 4 3 Humphrey Bogart Casablanca, Oscar Award . . . . . . . . . n

s_(i,j) is a computed similarity between asset a_(i) and asset a_(j) based on pairwise difference between the features of the two assets, not including the collectionID(s), ssfID, or keyword(s). The computation of this pairwise similarity may represent a more complex metric that is employed by the underlying recommendation engine at the disposal of the asset portfolio manager. In this example, a simple Minkowski metric (a computation of pairwise differences between asset a_(i) and itself represented by s_(i,i) typically yields a value of 1 (after standardization) and the matrix S is symmetric and non-negative for all i,j) is discussed that may be applied after all of the features have been standardized to a [0,1] scale.

${s_{i,j} = {{1 - {\left\{ {\sum\limits_{m = 1}^{k}{w_{m}\left( {x_{i,m} - x_{j,m}} \right)}^{Y}} \right\}^{1/Y}0}} \leq w_{m} \leq 1}},{\sum\limits_{m = 1}^{k}w_{m}},{Y > 0}$

The n×n matrix of pairwise similarities may be computed (independent of the ssfID, FileTypeID, and collectionIDs) in order for the co-presentation process to be employed. FIG. 3 illustrates the identification of similarities between pairs of assets.

The computed pairwise similarities may then be represented as:

$S = {\begin{bmatrix} 1 & s_{1,2} & \ldots & s_{1,{n - 1}} & s_{1,n} \\ s_{2,1} & 1 & \ldots & s_{2,{n - 1}} & s_{2,n} \\ \ldots & \ldots & \ldots & \ldots & \ldots \\ s_{{n - 1},1} & s_{{n - 1},2} & \ldots & 1 & s_{{n - 1},n} \\ s_{n,1} & s_{n,2} & \ldots & s_{n,{n - 1}} & 1 \end{bmatrix}.}$

To illustrate how the similarities are computed, FIGS. 4-9 provide an example of three assets {a_(i), a_(j), a_(k)} and the observed k=3 features values for each asset. FIG. 4 shows the values of the assets. FIGS. 5-7 show the absolute values of pairwise differences among the three assets {a_(i), a_(j), a_(k)} in the example data for features 1, 2, and 3, respectively. Using an exponent −1 and weights of w={w₁, w₂, w₃}={0.40, 0.30, 0.10} for features 1, 2, and 3, respectively, then FIG. 8 shows the composite weighted values of the three features for each pair of assets. From these values, the composite similarities for the example set of assets is shown in FIG. 9 .

α_(i,j) a multiplier selected by the asset portfolio manager indicating if certain pairings of assets should have a reduced chance of being co-presented where 0≤α_(i,j)<1 and, for any pair i,j not defined, it is assumed that α_(i,j)=1. Table 5 provides an example of some pairs of assets for which co-presentation is discouraged and for any value for which α_(i,j)=0, co-presentation may not be allowed.

TABLE 5 Down-weighting of Asset Pairs i j α_(i,j) 1 2 0.00 1 4 0.10 3 1 0.12 4 2 0.58 . . . . . . . . . q . . . . . .

The pairwise similarity matrix S along with Tables 1-5 represent the schema that can be used to build an ensemble of assets for presentation—given a_(i) (the anchor asset)—once the asset portfolio manager has designed the physical arrangement of asset for presentation, L_(t).

2.0 METHODOLOGY

With all of the inputs in place, the item portfolio manager may design the layout L_(t) and the underlying computations associated with an embodiment can be executed to automatically populate the presentation. FIG. 10 illustrates a sample layout, L_(o), which considers the physical layout of the presentation space available given the physical constraints of the device used to present the selected assets. Note that the layout region, B, can be subdivided in d=l, . . . , D sections, each having an anchor asset, a_(i) ^((d)). In each of the D sections, the item portfolio manager may then identify the particular ssfID and position where they would like to place each asset to be co-presented. In each slot r=1, . . . , R^((d))—the position where an asset is to be placed in a section of the presentation—an ssfID is defined by the asset portfolio manager.

2.1 Ensemble Strength

Within a section d, the entire set of co-presented assets in the R_(d) can be called an ensemble,

E _(d) ={a _(i) ^((d)) ,a _(j) ^((d)) , . . . , a _(R) ^((d))}.

In an embodiment, there may be four levels of ensemble strength:

-   -   1) Weak Ensemble: within a section d, the selection for assets         to be included for co-presentation are based on similarities         among every candidate asset in A^((−i)) and the anchor asset,         a_(i). At most, the number of pairwise comparisons of eligible         assets that needs to be completed is n−1 and so this a highly         computationally efficient approach to building an ensemble but         may lead to solutions where the elements selected from A^((−i))         are not very similar to each other.     -   2) Strong Ensemble: within a section d, the selection for assets         to be included for co-presentation are based not only on         similarities among every candidate asset in A^((−i)) and the         anchor asset, a_(i), but also every other candidate asset in         A^((−i)). If there are n_(r) ^((d)) eligible candidate assets to         fill slot r in section d, then the maximum number of possible         solutions to the problem of selecting the r=1, . . . , R_(d)         assets (disallowing duplication of the same asset within a         section) is:

$N_{d} = {\frac{\left( {n - 1} \right)!}{\left( {n - R_{d} - 1} \right)!}.}$

-   -    Note that, as n grows large, N_(d)→∞, making the strong         ensemble method computationally intractable in many cases.     -   3) Semi-strong Ensemble: Since the semi-strong ensemble is         computationally costly and the weak ensemble may result in a         solution where the assets from A^((−i)) that are chosen for         co-presentation with a_(i) are dissimilar, a compromise solution         is offered. Construction of a semi-strong ensemble is done         iteratively over each of the r=1, . . . , R_(d) slots in a         section d. In the initial asset selection the first slot (r=1)         is filled with an asset, a_(j), that is selected based on its         similarity with a_(i). In the next step, slot r=2 is filled with         asset a_(k) based on its similarity with both a_(i) and a_(j).         This process repeats over the R_(d) slots, each time comparing         candidate assets to those that have been selected for inclusion         in the section in previous iterations. Essentially, the ensemble         is “backward-looking” and adds an asset to a slot based on the         similarities with all assets in slots that have already been         filled—but does not consider the similarities of assets that may         be added in the filling of future slots. For small values of         R_(d) and n, it is possible that the strong, semi-strong, and         weak ensembles may be identical, though such an outcome is not         expected. However, the upper bound on the total number of         required comparisons for a given section d is no greater than

$\frac{{nR}_{d}\left( {R_{d} + 1} \right)}{2},$

-   -    a lesser computational burden (for larger n) than required         build a strong ensemble.     -   4) Improved Semi-strong Ensemble: After a semi-strong ensemble         is built, a second round of iterations can be applied that seek         to find improvements to the objective function. This is, of         course, more computationally costly than the semi-strong         ensemble method, but the additional cost is proportional to         (n′−2D) additional computations where n′ is the total number of         slots to be filled in the display L_(t) and D is the total         number of sections. The option to pursue an improved ensemble         using this construction method can be considered by the asset         portfolio manager based on a comparison of the tradeoff between         a desire to seek a solution that is: a) more consistent with a         final layout that may be achieved using the strong ensemble         method; and b) does not add an unacceptable computational         burden.

2.2 Ensemble Duplication

Within an ensemble, a requirement prohibiting or allowing duplicate assets within or across a section can also be defined. Where there always exists the requirement that the candidate assets for inclusion in the presentation of a section-level set of assets not duplicate the anchor asset a_(i)—that is A=A^((−i) ∪a) _(i), A^((−i))∩a_(i)=Ø—duplication may be allowed among the other non-anchor assets in the presentation of section d.

-   -   1. When local duplication is prohibited, no two assets within a         section d may be the same.     -   2. When global duplication is prohibited, no two assets within a         section d may be the same nor may any two assets across the d=1,         . . . , D sections be allowed in the layout, L_(t).

2.3 Ensemble Depth

Another consideration when building an ensemble is the degree to which the similarities among the assets in the d=1, . . . , D sections should be. The enforcement of similarity across sections in a layout L_(t) as the depth—depth can either be local or global. If the ensemble depth is local, then there is no requirement that the similarities across assets presented in different sections be considered in the selection of assets of L_(t). If the depth is global, pairwise comparisons of assets are computed both within a section d and across all sections d=1, . . . , D. Notationally, the similarities among assets is s_(i,j), but when further transformations are applied to adjust for commonality among keywords, file types, and enforce ssfID and down-weighting requirements, then similarities in section d among two its members, a^((d)) _(i) and a^((d)) _(j), can be denoted by f^((d)) _(i,j) and the vector of assets selected for presentation in section d is:

a ^((d)) ={a _(i) ^((d)) ,a _(j) ^((d)) , . . . , a _(R) _(d) ^((d))}

The application of depth may be accomplished by taking an overall representative of similarities of assets with a section d and the overall similarity of assets across the various sections d=1, . . . , D. Let;

$w^{(d)} = \left\{ {{\begin{matrix} {\min\limits_{j}f_{i,j}^{(d)}} & {{{{if}{weak}{ensembles}{are}{desired}j} = {1\ldots R_{d}}},{i \neq j}} \\ {\min\limits_{j,k}f_{j,k}^{(d)}} & {{{otherwise}j},{k = {1\ldots R_{d}}},{j \neq k}} \end{matrix}b^{(d)}} = \left\{ \begin{matrix} {\min\limits_{j}\left( {f_{i,j}^{(d)},f_{i,j}^{(g)}} \right)} & {{{{if}{weak}{ensembles}{are}{desired}j} = {1\ldots R_{d}}},{i \neq j},{g \neq d}} \\ {{\min_{j,k}f_{j,k}^{(d)}},f_{j,k}^{(g)}} & {{{otherwise}j},{k = {1\ldots R_{d}}},{j \neq k},{g \neq d}} \end{matrix} \right.} \right.$

-   -   and let the measure of depth across all d=1, . . . , D sections         in L_(t) be

m(L _(t))=w ^((d))+λ_(t) b ^((d)) where 0≤λ_(t)≤1.  {Equation 1}

Selection of assets for presentation within a layout, L_(t), is then equivalent to maximizing m(L_(t));

$\lambda_{t} = \left\{ {\begin{matrix} 0 & {{when}{local}{depth}{is}{desired}} \\ 1 & {{when}{global}{depth}{is}{desired}} \end{matrix}.} \right.$

2.4 Construction of Ensembles

In this section, the mechanics of the construction of weak, strong, and semi-strong ensembles is presented along with the steps to apply local or global duplication prohibition and local/global depth restrictions. Among all three methods of constructing ensembles, there are common steps to prepare the similarity matrix based on the availability of theme and keyword features possessed by each asset along with the down-weighting of certain asset pairs made to discourage co-presentation in a display.

2.4.1 Similarity Matrix Preparation for Ensemble Construction

Step 1a: each element in the diagonal of matrix S is set equal 0 to yield a matrix of similarities;

$S = \begin{bmatrix} 0 & s_{1,2} & \ldots & s_{1,{n - 1}} & s_{1,n} \\ s_{2,1} & 0 & \ldots & s_{2,{n - 1}} & s_{2,n} \\ \ldots & \ldots & \ldots & \ldots & \ldots \\ s_{{n - 1},1} & s_{{n - 1},2} & \ldots & 0 & s_{{n - 1},n} \\ s_{n,1} & s_{n,2} & \ldots & s_{n,{n - 1}} & 0 \end{bmatrix}$

This is equivalent to defining A^((−i)) since asset a_(i) has been removed from the set of candidate assets for co-display (avoiding duplication of the anchor asset) since one requirement for selection will be that the similarity value, s_(i,j), of a selected item a_(j) to be paired with the anchor asset, a_(j), is that s_(i,j)>0.

Step 1b: Identify entries in Table 5 to allow adjustment for the chance of being selected (lower values of s_(i,j) reduce the chances of selection for presentation). Undefined pairs for any i,j receive a value of α_(i,j)=1.

Step 1c: For each pair of assets {a_(i), a_(j)} of A (i,j=1, . . . , n) a multiplier, δ_(ij), is computed;

$\delta_{i,j} = {{\frac{w_{i}\bigcap w_{j}}{\max\left( {w_{i},w_{j}} \right)}0} \leq \delta_{i,j} \leq 1}$

-   -   where w_(i) is the number of keywords associated with a_(i), w         is the number of keywords associated with a_(j), and w_(i)∩w_(j)         is the number of keywords the two assets have in common.

Step 1d: For each pair of assets {a_(i), a_(j)} of A (i,j=1, . . . , n) a multiplier, v_(j,k), is computed;

$V_{i,j} = \left\{ {{\begin{matrix} 1 & {{if}{asset}i{shares}a{collection}{ID}{with}{asset}j} \\ \epsilon_{v} & {otherwise} \end{matrix}\epsilon_{v}} \geq 0.} \right.$

Step 2: Update the values of S by multiplying against the values computed in Step 1a-1c to adjust the similarities based on the relative concordance of keywords (lower concordance reduces the chances of selection for presentation), consistency of themes, and

s′ _(i,j) =s _(i,j)×δ_(i,j) ×v _(i,j)×α_(i,j)

The similarity matrix prepared for ensemble construction in then an adjusted n×n symmetric matrix, S′, representing all pair of assets {a_(i), a_(j)} of A (i,j=1 . . . , n).

$S^{\prime} = \begin{bmatrix} 0 & s_{1,2}^{\prime} & \ldots & s_{1,{n - 1}}^{\prime} & s_{1,n}^{\prime} \\ s_{2,1}^{\prime} & 0 & \ldots & s_{2,{n - 1}}^{\prime} & s_{2,n}^{\prime} \\ \ldots & \ldots & \ldots & \ldots & \ldots \\ s_{{n - 1},1}^{\prime} & s_{{n - 1},2}^{\prime} & \ldots & 0 & s_{{n - 1},n}^{\prime} \\ s_{n,1}^{\prime} & s_{n,2}^{\prime} & \ldots & s_{n,{n - 1}}^{\prime} & 0 \end{bmatrix}$

FIG. 11 depicts an example of how the adjusted similarity matrix is computed, specifically showing the process for two pairs of example assets {a₁, a₄} and {a₁,a₇}.

Step 3: For each element section d, slot r, and element s′_(i,j) of S′, three sub-steps are executed:

Step 3.a: a multiplier is computed, ϕ_(j) ^((d,r)), as

$\phi_{j}^{({d,r})} = \left\{ \begin{matrix} 1 & {{{if}a_{j}{has}{the}{ssfID}{required}{for}{slot}r},{{section}d}} \\ 0 & {otherwise} \end{matrix} \right.$

This step forces the asset selection process to prefer those assets that are consistent with the sense/stationarity/fileType specified by the asset portfolio manager. Setting a value of ε₀>0 for non-matched ssfID values is a possible less-strict option that can be employed to allow a selection not exactly specified by the asset portfolio manager.

Step 3.b: the additiveValue, ρ_(j), is extracted from Table 3 (the various file types associated with each ssfID) for each asset S′ based on a lookup of the fileTypeID found in Table 4 (the features of each asset including the file type). the purpose of this sub-step is to adjust the similarity values to prefer, among all assets that remain eligible for selection, the asset having the desired file type. FIG. 12 shows an example of how the ssfID and fileTypeIDs required for a slot r and section d are computed—matches are shown for eligible candidates for section 1

Step 3.c: Compute a final set of similarity pair for each section d each slot r as:

f _(i,j) ^((d,r)) =s′ _(i,j)×ϕ_(i,j) ^((d,r))×(1+ρ_(j) ^((d,r)))

Step 4: Among all similarity matrices, f_(j,k) ^((d,r)) where d=1, . . . D and r=1, . . . , R_(d), select the values such that f_(j,k) ^((d,r))>0 as initial candidates for that slot r and section d. To be eligible to fill a slot in section d, an asset a_(j) ^((d)), can have a non-zero similarity value (s′_(i,j)>0) when paired with the anchor asset a_(i) ^((d)) in that section. Note that it is disallowed for another asset to populate the slot(s) reserved for the anchor asset in each section.

The resulting candidate list may then be used to create a table F_(t) containing a list of possible assets that may be used to populate the display L_(t) for all sections d=1, . . . , D and their respective slots r=1, . . . , R_(d).

An example table Ft from preparation of the similarities is shown in Table 6. In the example, section d=1 has R_(d)=2 slots; one is reserved for the anchor asset (a₁ and a₂) and eligible anchor assets are based on the example in FIG. 12 . while the second slot may be filled with one of two assets {a₄ or a₇}. The implication is that the similarities of the candidates in the second slot of section d=1 have values that can be used to select an appropriate asset to pair with the anchor asset for section d=1 and slot r=2 given that the anchor asset is selected.

Following through with the examples in FIGS. 9-11 and Table 6, the value of pairwise

TABLE 6 Example Candidate Assets after Preparation for Ensemble Construction ssfID additive Eligible Asset Multiplier Value Section (d) slot (r) (j) p (d, r) p (d, r) 1 1 1 1 0.10 1 1 2 1 0.10 1 2 4 1 0.05 1 2 7 1 0.01 2 1 5 1  0.10| 2 1 6 1 0.10 2 2 5 1 0.15 2 2 6 1 0.08 2 3 5 1 0.10 2 3 6 1 0.01 2 4 5 1 0.10 2 4 6 1 0.01

similarities, f_(i,j) ^((d,r))=s′_(i,j)×ϕ_(i,j) ^((d,r))×(1+ρ_(j) ^((d,r))), used to fill position r=2 assuming the a_(l) is selected as the anchor asset for section d=1 are (refer to FIG. 10 for the adjusted similarities):

f _(1,4) ^((1,2)) =s′ _(1,4) ×f(^(1,2)) _(1,4)×ρ₄ ^((1,2))=0×1×(1+0.05)=0 and

f _(1,7) ^((1,2)) =s′ _(1,7) ×f ^((1,2)) _(1,7)×ρ₇ ^((1,2)) =s1,7×0.2×1×(1+0.01)=s _(1,7)×0.202

2.4.2 Weak Ensemble Construction

In an embodiment, after L_(t) has been defined, the automated process for selecting assets for co-presentation of a weak ensemble works as follows for each of the D sections:

Step 1: Given the desired choice of duplication restrictions (local or global), identify the set of all possible and allowable arrangements, a^((d)), of assets from A that can be used to populate the display, L_(t) under the restriction that the anchor assets and their positioning in each section, a^((d)) _(i), is required.

Step 2. Select a*, the unique arrangement of a that maximizes

m(L _(t))=w ^((d)) as defined earlier by Equation 1

-   -   which is equivalent to, for each slot, choosing the eligible         candidate asset a_(j) for which the similarity between it and         the section's anchor asset a_(i) ^((d)) is maximized (Note that         in the weak ensemble method, a global depth restriction may not         be sensible). The arrangement of assets selected as a* are then         used to populate the display and can guarantee that every asset         selected for presentation in section d has a non-zero similarity         with the anchor asset in the section (local depth). In the         example discussed above, earlier, if a_(i) is selected as the         anchor asset and another asset meeting the ssfID and fileTypeID         restrictions are applied to slot r=2 and section d=1, then only         assets a₄ or a₇ can be selected to fill the position. Since a₇         has a larger value (0.202×s_(1,7) assuming that s_(1,7)>0) for         the final similarity computation f^((1,2)) _(1,j) than asset a₄         when paired with asset a_(i), then it would be chosen to fill         the d=1, r=2 position in the display.

2.4.3 Strong Ensemble Construction

In an embodiment, after L_(t) has been defined, the automated process for selecting assets for co-presentation of a strong ensemble works as follows for each of the d=1, . . . , D sections:

Step 1: Given the desired choice of duplication restrictions (local or global), identify the set of all possible and allowable arrangements, a^((d)), of assets from A that can be used to populate the display, L_(t) under the restriction that the anchor assets and their positioning in each section, a^((d)) _(i), is required.

Step 2: Some possible arrangements of a^((d)) are removed from eligibility as an acceptable solution for populating layout L_(t). Any arrangement containing a pair of assets such that the similarity f^((d)) _(j,k)=0 (j,k=1, . . . , n) within a section d is removed. This is equivalent to enforcing the restriction that an eligible asset need to not only have a non-zero similarity with the anchor asset, but also to every other asset that may be co-displayed in the same section (Since this step may result in an empty set of solutions, non-zero choices for ε_(δ), ε₀ and α_(i,j) can be set to small-but-positive values to ensure that some possible arrangements are available, albeit sub-optimal). For example, if a_(i) is selected as the anchor asset for a display with r=3 assets, then the other two assets, a_(j) and a_(k), selected to fill the other positions must not only be similar to a_(i), but must be similar to each other—that is, consideration must be given to the similarities among {a_(i),a_(j)}, {a_(i),a_(k)} and {a_(j),a_(k)}.

Step 3: If a global depth restriction is desired, then some possible arrangements of a^((d)) are removed from eligibility to be an acceptable solution for populating layout L_(t). Any arrangement containing a pair of assets such that the similarity f^((d)) _(j,k)=0 (j,k=1, . . . , n) is removed. This is equivalent to enforcing the restriction that an eligible asset need to not only have a non-zero similarity with the anchor asset, but also to every other asset that will be co-displayed across all sections d=1, . . . , D.

Step 4: For each, select a*, the unique arrangement of a that maximizes m(L_(t))=w^((d))+λ_(t) b^((d)) given the desired depth (via appropriate selection of λ).

The arrangement of assets selected as a* may then be used to populate the display and can guarantee that every pair of assets within a section d has a non-zero similarity with every other asset selected for co-display in the section (local depth). If a global depth restriction is applied, then there will be an additional guarantee that every pair of assets within the display L_(t) has a non-zero similarity with every other asset selected for co-display in the display.

2.4.4 Semi-Strong Ensemble Construction

In an embodiment, after L_(o) has been defined, the automated process for selecting assets for co-presentation of a semi-strong ensemble can be executed. The semi-strong approach is performed iteratively to strike a compromise between the weak and strong ensemble approaches in terms of computational efficiency. The key difference in this approach relative to the other ensemble-building steps is that the final arrangement of assets, a*, is not necessarily the optimal one (that would be selected using all possible arrangements of eligible assets using the strong ensemble approach) but rather seeks to iteratively find the best asset for display given all assets chosen in earlier iterations.

Step 1: Some possible elements of a^((d)) are removed from eligibility as an acceptable solution for populating layout L_(t). Any element of F_(t) containing a pair of assets such that the similarity f^((d)) _(j,k)=0 (j,k=1, . . . , n) within a section d is removed. This is equivalent to enforcing the restriction that an eligible asset need to not only have a non-zero similarity with the anchor asset, but also to every other asset that will be co-displayed in the same section (Since this step may result in an empty set of solutions, non-zero choices for ε_(δ), ε₀ and α_(i,j) can be set to small-but-positive values to ensure that some possible arrangements are available, albeit sub-optimal).

Step 2: If a global depth restriction is desired, then some elements of a^((d)) are removed from eligibility to be an acceptable solution for populating layout L_(t). Any element of Ft containing a pair of assets such that the similarity f^((d)) _(j,k)=0 (j,k=1, . . . , n) is removed. This is equivalent to enforcing the restriction that an eligible asset need to not only have a non-zero similarity with the anchor asset, but also to every other asset that will be co-displayed across all sections d=1, . . . , D.

Step 3: Among remaining elements of Ft that remain, select a*, the unique arrangement of a that maximizes the objective function m(L_(t)) is computed. In sequential order, iterate over each section d=1, . . . , D and each slot within a section r=1, . . . , R_(d)—skipping over each anchor asset a_(i) ^((d)) since its slot choice is predetermined and set by L_(t) as follows:

Step 3a: To yield an ordered vector of similarities representing the assets that may be placed in the current (the r^(th)) position of section d, select the asset a_(j) ^((d,r)) eligible to be placed in that position (based on Ft) such that the similarity between that asset and all other assets previously identified to fill slots in the iterative process is non-zero. Additionally, among the assets with non-zero similarities and with local depth, compute

${w^{(d)} = {\min\limits_{j,k}f_{j,k}^{(d)}j}},{k = {1\ldots\left( {r - 1} \right)}},{j \neq k}$

-   -   and, for the a^((d,r)) element of a*, choose the value which         maximizes w^((d)) for which duplicates have been omitted         according to the local or global choice made for duplicate         values-.

Step 3b: If global depth is required, also compute:

b ^((d))=min{f _(j,k) ^((d)) ,f _(j,k) ^((g)) } j,k=1 . . . (r−1)∪(a _(i) ⁽¹⁾ ,a _(i) ⁽²⁾ , . . . , a _(i) ^((D)))},j≠k,g≤d

-   -   and select the asset a_(j) to fill slot r in section d with the         asset that maximizes

m(L _(t))=w ^((d))+λ_(t) b ^((d)) where 0≤λ_(t)≤1

-   -   during the current iteration and taking into account all other         assets that have been previously selected to join a* in previous         iterations.

The arrangement of assets selected as a* are then used to populate the display and, though may not be the global optimum, are considered to be optimal at the time during the iterative process when each asset was selected. These steps also ensure that depth and duplication requirements are considered when choosing assets for inclusion in a*.

In an example of how the semi-strong example would work for a given section d where the number of slots R_(d)=3 the local depth approach is used so the objective within the section is to maximize the value of m_(t)(L_(t)) in the section's ensemble. Initially, the first slot r=1 is populated with the anchor asset, a_(i), so a*^((d,1))=a_(i). Moving on to the next slot, r+1=2, the asset, a_(j), is selected having the largest similarity to the asset in slot 1, a*(^(d,1)). Once selected, the optimal set now contains a^(*(d))={a_(i), a_(j)}. In the next iteration, the selected asset, a_(k), is the one taken over all candidates for which the smallest of similarities f_(i,k) f_(i,k) is maximized. That is, the measure of similarity of an asset consideration to join the optimal set, a*(d) is based on a comparison to all existing members of the set established in prior iterations.

2.4.5 Improved Semi-Strong Ensemble Construction

Knowing that the semi-strong ensemble construction might not be the optimal solution—through constructed optimization considered at each iteration over the d=1, . . . D and r=1, . . . , R_(d) possible slot—it may be possible to review the results of the resulting semi-strong solution a* and seek improvements. To construct the improved semi-strong ensembles for the layout L_(t), it is of course required that semi-strong iterations first be conducted and a* established. The improvement process is fairly simple and based on a second set of iterations over each section d=1, . . . , D and each slot within a section r=1, . . . , R_(d)— skipping over each anchor asset a_(i) ^((d)) since their slot choice is predetermined and set by L_(t)—as follows:

Step 1: Define a_(j)) as the optimal set, excluding the asset a_(j) filling the (d,r)th position of a* “(corresponding to the current iteration);

a ^((−i)) ∩a*=Ø,a*=a* ^((−i)) ∪a*.

Step 2: Substitute another candidate asset a_(k)≠a_(j) that is eligible to fill the a^((d,r)) position of the optimal ensemble given duplicate and depth restrictions. Recompute:

m(L _(t))=w ^((d))+λ_(t) b ^((d)) where 0≤λ_(t)≤1

-   -   to obtain the value that would result from the substitution.

Step 3: If the objective function m(L_(t)) is improved based on the substitution, replace the appropriate value of a* with the substitution.

The net result is that the final layout reflects a positioning and ensemble a* that yields a value of the objective function m(L_(t)) that is the equivalent or more acceptable than the semi-strong ensemble that existed prior to the improvement process. Additionally, if substitutions do exist, the improvement process forces the final ensemble a* to be more similar to the one that would have existed if the computationally-costly strong ensemble method had been employed.

2.5 Layout Comparison

Although a single layout, L_(t) has been discussed, embodiments can be used to evaluate the performance of various layouts and choices of depth, duplicate asset restrictions, and ensemble construction. In an online environment, the various inputs and asset portfolio manager choices may be varied to determine if an alternative to a single layout might be more effective. If the asset portfolio manager has a known and measurable performance metric in mind (for example, maximize revenue or engagement time), he/she may then use an embodiment to compare the performance of various presentation layouts/requirements in a multivariate or A/B testing environment to identify how various layouts might improve performance. If A is the asset library available to the portfolio manager, L={L_(o), L₁, . . . , L_(m)} and R is a set of rules encompassing the choice of ensemble strength, duplicates, and down-weighted asset pairs, and λ is the depth parameter, then the item portfolio may evaluate m(L_(t)|R_(t),A, λ_(t)) for various combinations of R_(t), λ_(t) to determine the set of inputs that yields a maximum value of the performance metric.

It should be noted that the execution of this series of steps may be computed by a single actor (the asset portfolio manager) based at single physical location or multiple actors in single or multiple physical locations (FIG. 13 ).

An embodiment can use a version discussed above to maximize the objective function m(L_(t)|R_(t), A, λ_(t)) for any set of values. The embodiment can perform:

-   -   1) video recommendations in an ordered playlist (a         single-section layout) having a fixed number of slots (as         determined by the client). The system uses a single         sense/stationarity combination (sight/non-stationary) and allows         any video type that is playable by the client software. The         system may prohibit global duplication, uses a global depth         requirement, down-weights asset pairs probability of         co-presentation using client-provided business rules, and         employs the semi-strong ensemble-building method.     -   2) article-to-video layouts that take a text-based asset (a news         article) as the anchor asset in each section and provides video         recommendations in an ordered playlist (a single-section layout)         having a fixed number of slots (as determined by the client) to         accompany the text. The system uses a single sense/stationarity         combination (sight/non-stationary) and allows any video type         that is playable by the client software and selects video         pairings that have the highest similarity to the anchor asset         and other assets in the ensemble using the semi-strong ensemble         approach.     -   3) Multivariate and A/B testing can be performed to determine if         the weak ensemble and semi-strong ensemble methods yield similar         results and, if so, using the weak ensemble method to minimize         computational resources needed to support the presentation.     -   4) Recommendation engine—a recommendation engine can be         implemented that determines how similarities are computed and         feeds the ensemble-building.     -   5) Allows a continuous-play feature that seamlessly builds a         sequential set of videos that may be viewed without         interruption. The feature allows an entire video playlist to be         constructed and consumed and, should user interaction suggest         that a particular video asset consumption has terminated,         present a revised list of recommended assets presented as an         ensemble.

In an embodiment, an apparatus comprises a processor and is configured to perform any of the foregoing methods.

In an embodiment, a non-transitory computer readable storage medium, storing software instructions, which when executed by one or more processors cause performance of any of the foregoing methods.

Note that, although separate embodiments are discussed herein, any combination of embodiments and/or partial embodiments discussed herein may be combined to form further embodiments.

3.0. EXAMPLE EMBODIMENTS

Examples of some embodiments are provided, without limitation, in the following paragraphs.

According to some embodiments, a method or non-transitory computer readable medium comprises: receiving a specification of an asset to be displayed during an online engagement; selecting a variety of assets related to the asset to be displayed during the online engagement, the variety of assets including any combination of: audio, video, text, or image, wherein the variety of assets are selected based on a relationship with the asset to be displayed during the online engagement; arranging the selected variety of assets into a single presentation screen; sending the presentation screen to a receiving device for display.

According to some embodiments, an apparatus comprises: a specification receiver that receives a specification of an asset to be displayed during an online engagement; an asset selector that selects a variety of assets related to the asset to be displayed during the online engagement, the variety of assets including any combination of: audio, video, text, or image, wherein the variety of assets are selected based on a relationship with the asset to be displayed during the online engagement; a presentation screen creator that arranges the selected variety of assets into a single presentation screen; a presentation screen sender that sends the presentation screen to a receiving device for display.

In some of the above embodiments, the single presentation screen is arranged using display parameters of the receiving device.

In some of the above embodiments, the variety of assets are selected based on a relationship between each asset among the variety of assets and the asset to be displayed during the online engagement.

In some of the above embodiments, each asset among the variety of assets is selected based on a relationship between the asset and the asset to be displayed during the online engagement, and wherein the asset is selected based on a relationship between the asset and all other assets in the variety of assets.

In some of the above embodiments, each asset among the variety of assets is selected based on a relationship between the asset and each asset among the variety of assets that was selected previous to the asset.

In some of the above embodiments, the asset to be displayed during the online engagement is textual.

In some of the above embodiments, the variety of assets are selected based on a common theme of the variety of assets.

In some of the above embodiments, each asset among the variety of assets is selected based on a variable probability.

In some of the above embodiments, at least a portion of the variety of assets are selected and arranged in the presentation screen based on human senses.

In some of the embodiments, the single presentation screen is arranged using display parameters of the receiving device.

In some of the above embodiments, the variety of assets are selected based on a relationship between each asset among the variety of assets and the asset to be displayed during the online engagement.

Other examples of these and other embodiments are found throughout this disclosure.

4.0 IMPLEMENTATION MECHANISMS—HARDWARE OVERVIEW

According to an embodiment, the techniques described herein are implemented by one or more special-purpose computing devices. The special-purpose computing devices may be hard-wired to perform the techniques, or may include digital electronic devices such as one or more application-specific integrated circuits (ASICs) or field programmable gate arrays (FPGAs) that are persistently programmed to perform the techniques, or may include one or more general purpose hardware processors programmed to perform the techniques pursuant to program instructions in firmware, memory, other storage, or a combination. Such special-purpose computing devices may also combine custom hard-wired logic, ASICs, or FPGAs with custom programming to accomplish the techniques. The special-purpose computing devices may be desktop computer systems, portable computer systems, handheld devices, networking devices or any other device that incorporates hard-wired and/or program logic to implement the techniques.

For example, FIG. 14 is a block diagram that illustrates a computer system 1400 upon which an embodiment may be implemented. Computer system 1400 includes a bus 1402 or other communication mechanism for communicating information, and a hardware processor 1404 coupled with bus 1402 for processing information. Hardware processor 1404 may be, for example, a general purpose microprocessor.

Computer system 1400 also includes a main memory 1406, such as a random access memory (RAM) or other dynamic storage device, coupled to bus 1402 for storing information and instructions to be executed by processor 1404. Main memory 1406 also may be used for storing temporary variables or other intermediate information during execution of instructions to be executed by processor 1404. Such instructions, when stored in non-transitory storage media accessible to processor 1404, render computer system 1400 into a special-purpose machine that is customized to perform the operations specified in the instructions.

Computer system 1400 further includes a read only memory (ROM) 1408 or other static storage device coupled to bus 1402 for storing static information and instructions for processor 1404. A storage device 1410, such as a magnetic disk, optical disk, or solid-state drive is provided and coupled to bus 1402 for storing information and instructions.

Computer system 1400 may be coupled via bus 1402 to a display 1412, such as a cathode ray tube (CRT), for displaying information to a computer user. An input device 1414, including alphanumeric and other keys, is coupled to bus 1402 for communicating information and command selections to processor 1404. Another type of user input device is cursor control 1416, such as a mouse, a trackball, or cursor direction keys for communicating direction information and command selections to processor 1404 and for controlling cursor movement on display 1412. This input device typically has two degrees of freedom in two axes, a first axis (e.g., x) and a second axis (e.g., y), that allows the device to specify positions in a plane.

Computer system 1400 may implement the techniques described herein using customized hard-wired logic, one or more ASICs or FPGAs, firmware and/or program logic which in combination with the computer system causes or programs computer system 1400 to be a special-purpose machine. According to one embodiment, the techniques herein are performed by computer system 1400 in response to processor 1404 executing one or more sequences of one or more instructions contained in main memory 1406. Such instructions may be read into main memory 1406 from another storage medium, such as storage device 1410. Execution of the sequences of instructions contained in main memory 1406 causes processor 1404 to perform the process steps described herein. In alternative embodiments, hard-wired circuitry may be used in place of or in combination with software instructions.

The term “storage media” as used herein refers to any non-transitory media that store data and/or instructions that cause a machine to operate in a specific fashion. Such storage media may comprise non-volatile media and/or volatile media. Non-volatile media includes, for example, optical disks, magnetic disks, or solid-state drives, such as storage device 1410. Volatile media includes dynamic memory, such as main memory 1406. Common forms of storage media include, for example, a floppy disk, a flexible disk, hard disk, solid-state drive, magnetic tape, or any other magnetic data storage medium, a CD-ROM, any other optical data storage medium, any physical medium with patterns of holes, a RAM, a PROM, and EPROM, a FLASH-EPROM, NVRAM, any other memory chip or cartridge.

Storage media is distinct from but may be used in conjunction with transmission media. Transmission media participates in transferring information between storage media. For example, transmission media includes coaxial cables, copper wire and fiber optics, including the wires that comprise bus 1402. Transmission media can also take the form of acoustic or light waves, such as those generated during radio-wave and infra-red data communications.

Various forms of media may be involved in carrying one or more sequences of one or more instructions to processor 1404 for execution. For example, the instructions may initially be carried on a magnetic disk or solid-state drive of a remote computer. The remote computer can load the instructions into its dynamic memory and send the instructions over a telephone line using a modem. A modem local to computer system 1400 can receive the data on the telephone line and use an infra-red transmitter to convert the data to an infra-red signal. An infra-red detector can receive the data carried in the infra-red signal and appropriate circuitry can place the data on bus 1402. Bus 1402 carries the data to main memory 1406, from which processor 1404 retrieves and executes the instructions. The instructions received by main memory 1406 may optionally be stored on storage device 1410 either before or after execution by processor 1404.

Computer system 1400 also includes a communication interface 1418 coupled to bus 1402. Communication interface 1418 provides a two-way data communication coupling to a network link 1420 that is connected to a local network 1422. For example, communication interface 1418 may be an integrated services digital network (ISDN) card, cable modem, satellite modem, or a modem to provide a data communication connection to a corresponding type of telephone line. As another example, communication interface 1418 may be a local area network (LAN) card to provide a data communication connection to a compatible LAN. Wireless links may also be implemented. In any such implementation, communication interface 1418 sends and receives electrical, electromagnetic or optical signals that carry digital data streams representing various types of information.

Network link 1420 typically provides data communication through one or more networks to other data devices. For example, network link 1420 may provide a connection through local network 1422 to a host computer 1424 or to data equipment operated by an Internet Service Provider (ISP) 1426. ISP 1426 in turn provides data communication services through the world wide packet data communication network now commonly referred to as the “Internet” 1428. Local network 1422 and Internet 1428 both use electrical, electromagnetic or optical signals that carry digital data streams. The signals through the various networks and the signals on network link 1420 and through communication interface 1418, which carry the digital data to and from computer system 1400, are example forms of transmission media.

Computer system 1400 can send messages and receive data, including program code, through the network(s), network link 1420 and communication interface 1418. In the Internet example, a server 1430 might transmit a requested code for an application program through Internet 1428, ISP 1426, local network 1422 and communication interface 1418.

The received code may be executed by processor 1404 as it is received, and/or stored in storage device 1410, or other non-volatile storage for later execution.

5.0 EXTENSIONS AND ALTERNATIVES

In the foregoing specification, embodiments have been described with reference to numerous specific details that may vary from implementation to implementation. The specification and drawings are, accordingly, to be regarded in an illustrative rather than a restrictive sense. The sole and exclusive indicator of the scope of the embodiments, and what is intended by the applicants to be the scope of the embodiments, is the literal and equivalent scope of the set of claims that issue from this application, in the specific form in which such claims issue, including any subsequent correction. 

What is claimed is:
 1. A computer-implemented method comprising: determining, by one or more computing processors and based at least partially on user interaction data collected by a user computing device operated by a user, a selected anchor media content item is to be rendered next with the user computing device to the user; selecting, by the one or more computing processors and based at least in part on pairwise comparisons with one or more features of the selected anchor media content item, one or more additional media content items from a plurality of media content items related to the selected anchor media content item, wherein the one or more features of the selected anchor media content item include a specific feature identifying whether the selected anchor media content item is a non-video file, wherein a similarity matrix comprising matrix values generated from the pairwise comparisons is used to select the one or more additional media content items for co-displaying with the anchor media content item on an image display of the user computing device; generating, by the one or more computing processors, a computer display page by arranging the anchor media content item and one or more additional media content items into the computer display page; causing, by the one or more computer processors, the computer display page to be sent to the user computer device for display on the image display of the user computing device.
 2. The method of claim 1, wherein the anchor media content item and one or more additional media content items displayed on the computer display page are arranged using the display parameters of the user computing device.
 3. The method of claim 1, wherein the one or more additional media content items are selected based on a relationship between each media content item among the one or more additional media content items and the anchor media content item.
 4. The method of claim 1, wherein each media content item among the one or more additional media content items is selected based on a relationship between said each media content item and each other media content item among the one or more additional media content items.
 5. The method of claim 1, wherein the anchor media content item and the one or more additional media content items include at least one media content item that is textual and at least another media content item that is non-textual.
 6. The method of claim 1, wherein the one or more additional media content items are selected based on a common theme.
 7. The method of claim 1, wherein at least a portion of the one or more additional media content items is selected and arranged in the electronic display page based on human sensing modes.
 8. A system, comprising: one or more computing processors; and one or more non-transitory computer-readable media storing instructions, which when executed by the one or more computing processors, cause the one or more computing processors to perform: determining, by the one or more computing processors and based at least partially on user interaction data collected by a user computing device operated by a user, a selected anchor media content item is to be rendered next with the user computing device to the user; selecting, by the one or more computing processors and based at least in part on pairwise comparisons with one or more features of the selected anchor media content item, one or more additional media content items from a plurality of media content items related to the selected anchor media content item, wherein the one or more features of the selected anchor media content item include a specific feature identifying whether the selected anchor media content item is a non-video file, wherein a similarity matrix comprising matrix values generated from the pairwise comparisons is used to select the one or more additional media content items for co-displaying with the anchor media content item on an image display of the user computing device; generating, by the one or more computing processors, a computer display page by arranging the anchor media content item and one or more additional media content items into the computer display page; causing, by the one or more computer processors, the computer display page to be sent to the user computer device for display on the image display of the user computing device.
 9. The system of claim 8, wherein the anchor media content item and one or more additional media content items displayed on the computer display page are arranged using the display parameters of the user computing device.
 10. The system of claim 8, wherein the one or more additional media content items are selected based on a relationship between each media content item among the one or more additional media content items and the anchor media content item.
 11. The system of claim 8, wherein each media content item among the one or more additional media content items is selected based on a relationship between said each media content item and each other media content item among the one or more additional media content items.
 12. The system of claim 8, wherein the anchor media content item and the one or more additional media content items include at least one media content item that is textual and at least another media content item that is non-textual.
 13. The system of claim 8, wherein the one or more additional media content items are selected based on a common theme.
 14. The system of claim 8, wherein at least a portion of the one or more additional media content items is selected and arranged in the electronic display page based on human sensing modes.
 15. One or more non-transitory computer-readable media storing one or more sequences of instructions, which when executed by one or more computing processors, cause the one or more computing processors to perform: determining, by the one or more computing processors and based at least partially on user interaction data collected by a user computing device operated by a user, a selected anchor media content item is to be rendered next with the user computing device to the user; selecting, by the one or more computing processors and based at least in part on pairwise comparisons with one or more features of the selected anchor media content item, one or more additional media content items from a plurality of media content items related to the selected anchor media content item, wherein the one or more features of the selected anchor media content item include a specific feature identifying whether the selected anchor media content item is a non-video file, wherein a similarity matrix comprising matrix values generated from the pairwise comparisons is used to select the one or more additional media content items for co-displaying with the anchor media content item on an image display of the user computing device; generating, by the one or more computing processors, a computer display page by arranging the anchor media content item and one or more additional media content items into the computer display page; causing, by the one or more computer processors, the computer display page to be sent to the user computer device for display on the image display of the user computing device.
 16. The media of claim 15, wherein the anchor media content item and one or more additional media content items displayed on the computer display page are arranged using the display parameters of the user computing device.
 17. The media of claim 15, wherein the one or more additional media content items are selected based on a relationship between each media content item among the one or more additional media content items and the anchor media content item.
 18. The media of claim 15, wherein each media content item among the one or more additional media content items is selected based on a relationship between said each media content item and each other media content item among the one or more additional media content items.
 19. The media of claim 15, wherein the anchor media content item and the one or more additional media content items include at least one media content item that is textual and at least another media content item that is non-textual.
 20. The media of claim 15, wherein the one or more additional media content items are selected based on a common theme.
 21. The media of claim 15, wherein at least a portion of the one or more additional media content items is selected and arranged in the electronic display page based on human sensing modes. 